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Ivo distinct analytical techniques were used to develop 
ttodels to forecast the numter of high quality non^rior 
service sales enlisting monthly in the U.S. Navy on a 
recruiting district level. The Box-Jenkins methodology vas 
applied to a monthly time series of enlistments for the 
period Octcter 1978 tc September 1983. A multiple regression 
causal model was developed based on the explanitory vari- 
ables; numbers of unemployed, change in the rate of unem- 
ploysert and military/civilian pay ratio. A combined time 
ser ies/causal model vas developed by applying the 
Eox-Jerkins technique to the residuals of the multiple 
regression. These models were compared for predictive 
validity, fiecommendations fcr further development of models 
containing explicit time series elements are presented. 
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i. EiCSlEH 

Curreot military policy includes an expansion of the 
Navj fica the present level cf approximately 500 ships to 
apprcximately 600 ships by the end of the decade. Ibis 
expansion is being conducted at a time when the primary 
recruiting pool# 17 - 21 year old males# is declining. Ihere 
are tvo avenues which must be pursued to prevent a short 
fall in personnel tc man the expanding fleet. First# the 
Navy iiust retain high quality personnel and secondly, the 
Navy must recruit sufficient numbers of high quality 
perscnnel tc replace attritions and simultaneously meet ever 
increasiig manpower requirements. 

It is not enough just to recruit warm todies tc fill 
vacancies. The ability cf the American serviceman to 

adequately operate and maintain equipment of increasing 
sophistication has been widely debated in Congress and the 
press. Stcries abound of the expected deterioration cf 
equipment readiness during periods of increased tensicns in 
the event of the loss of civilian personnel. To maintain an 
effective fcrce# the Armed Services must be able to operate 
and maintain its* equipment independent of nonccmbatant 
civilian technicians. In order to achieve this goal# 
perscnnel with the ability to learn the operation and main- 
tenance of ever mere sophisticated equipment must be 
obtained and retained. 

Cne avenue proposed to alleviate a manpower shortfall is 
an expanded role for women in the Armed Forces. This would 
seem# cn the surface# to be a viable solution to the problem 
since women have been historically less expensive to recruit 



and retain than aen of comparable quality [Bef. 1]. 
However, the Navy is precluded from taking full advantage of 
this source of ttanpcvet due to public law precluding the 
permanent assignment of women to combatant ships. This 
requires focusing the recruiting effort towards high quality 
men. The definition of "high quality" has been widely 
debated. For the purposes of this study, "high quality" will 
be defined as high school graduates in Mental Groups I 
through III upper as measured by test scores on the Armed 
Forces Vocational Battery of tests (ASVAB) , 

E. ICCCS 

This thesis is concerned with developing a model to 
forecast the number of high quality nonprior service males 
signing contracts on the recruiting district level. The 
dependent variable is net contracts signed: CNTBCTS. Values 
of this variable are determined by subtracting Delayed Entry 
Frogram (DBF) attrites during the month of attrition from 
contracts signed during the same month. 

Three approaches to forecasting CNTBCTS are taken. 
First, the Box- Jenkins methodology is applied to a mcrthly 
time series of the net number of nonprior service high 
school graduate males signing contracts in the Albuquerque 
Hew Cexico recruiting district during the period October 
1978 to September 1983 (60 data points). This methodology is 
explained in detail in Appendix A. The second approach 
utilizes "best fit" lagged variables in a block regression 
format to provide a causal model using economic data, the 
number of production recruiters and military/civilian pay 
ratio as explanitory variables. The third procedure 

combines the causal model and the Box-Jenkins methodology in 
a mixed time series/causal model to attempt to provide more 
accurate forecasts. 
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C. 1E£ IBVIBONHfHT 



lUs tb€sis will limit the scope of study to a single 
recruiting district. It is hypothesized that better esti- 
mates of the effects of changing economic factors and the 
effects cf the variation in the number of production recrui- 
ters kill te obtained at this level than on a national 
scope. During any change in the general trends of the 
nation’s economy, local conditions are apt to vary consider- 
ably from recruiting area to recruiting area. It may be that 
the eccncsic trends in a particular recruiting district are 
in direct opposition tc those on the national level, ihile 
goals and guotas are specified at a national level, recruit- 
ment cf individuals occurs at the district level. Ihe Navy 

Secruiting Command (NRC) allocates a fraction cf the 

national gucta to each of the Navy Recruiting Areas (NRA) . 
Ihe six NRA's in turn allocate fractions of their guota to 
each Navy Recruiting District (NRD) within the area. 
Nistcrical enlistment rates and differences in regional 
population are primary determinants of guota allocation, 
although economic and demographic factors are also taken 
into consideration ££e£. 2]. Information obtained from a 
regional study, through a better understanding of the inter- 
acting processes, may be cf more practical value in allo- 
cating recruiting resources for maximum benefit, and in more 
eguitably assigning recruiting goals. 

Ihe recruiting district chosen for study is the 
Albuguergue, New Mexico recruiting district. This district 
was chosen at random from the 44 recruiting districts nation 
wide. This district encompasses the entire state of New 
Mexico, the panhandles of Texas and Oklahoma as well as 
seven counties in south western Kansas. In general this 
district experienced an increasing work force, increasing 
unemployment and a significant variation in the number of 
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producticn xecruiters during the period of study. Appendix B 
contains the time series plots of the variables. 

Background demographic factors for the area that are cf 
interest are as follcvs, Ihe average per capita income for 
the state cf New Mexico in 1979 was $ 6119, 43 rd in the 
naticr- Ihe population in 1980 was 75X White, 2 % Black and 
8 % American Indian. 31 % of the population was of Spanish 
origin, 13?i of the population was between 18 and 24 years 
old ££ef, 3j. 
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II. A BEVIJi OF IBE LIIEBITOBE ON fllllTAfiY SQg PlI 
A. CABIIHG (1979) 

In his thesis or the supply of first term enlistees to 
the United States Marine Corps, Darling, [Ref. 4], provides 
a detailed review of the literature on the supply models 
developed from the Gates Commission (November 1970) to the 
publication of his thesis (March 1979) . The following 

review will cover the pertinent literature commencing with 
Darling’s work. 

Darling used two distinct analytical technigues to 
develop models in order to forecast monthly first term 
enlistees in the Onited States Marine Corps. A multiple 
regression model was derived based on it's compatibility 
with his theory of occupational choice, the intuitive appeal 
of certain explanatory variables, the past literature of 
manpower supply and the statistical significance of each 
variables impact on monthly enlistments. The data base 

utilized was a time series of monthly enlistments spanning 
the period from July 1973 to June 1978. 

For his multiple regression model. Darling chose as the 
dependent variable; monthly observations of the ratio of 
sale nonpricr service high school graduates enlisting in the 
regular Marine Corps to male high school graduates, aged 16 
to 24, not enrolled in college (S) . Darling chose as the 
independent variables: (1) Civilian - Military pay ratio 

(CMPB), (2) a deseascnalized estimate of the national unem- 
ployment rate for persons aged 16 to 19 (UNEM) , (3) the 

monthly number of gualified leads obtained from postcards 
which were included in the national printed media advertise- 
ments (LEADS), (4) a dummy variable to correct for the 
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abDoxially large nunier of enlistments in the acnth of 
Cecemter 1976 to take advantage of the expiring GI Bill (EJ) 
and (5) the number cf Marine recruiters on duty each month 
(BFFEEC) . Bis initial hypothesis was that the number of 
monthly enlistments Is dependent on the above indepecdent 
variables in the functional form of a logistic model: 

S= n + exp [-(B^+E^CMEf 

♦ B^EJfREC *B^aMM LEADS +B^DJ +a)]^ 

Cnly these regular enlistments of male high schocl grad- 
uates in the top two mental groups were considered tc ensure 
that the observed number of monthly enlistments was not 
restricted by Marine Corps policy (demand driven) and was 
therefore at accurate representation of points on the supply 
curve. All the independent variables listed above (except 
EJ) were lagged from one tc six months and regressions were 
run tc discover those variables that significantly contrib- 
uted tc the pattern of enlistments and to ascertain the 
approximate delay in the impact on contracts signed of a 
change in the value of one of the variables. The final 
regression eguation selected was based on the intuitive 
appeal cf the regression eguation and the statistical 
significance of the results. The best fit eguation was 
determined to be: 



= -2.669 -5.647 CMPE5 +1.00 UNEM + 0.672 EJ +a 



where : 

S 

CMPE5 

ONEH 

DJ 



the enlistment rate 

civilian - military pay ratio 
lagged five months 

unemployment with no lag 

dummy variable for December/ 1976 
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a = random error. 

all independent variables were reported to be signifi- 
cant at tbe 10% level with a coefficient of determination, 
R-sguared value, of .42. Of significance is tae exclusion 
of the variable representing recruiters. Darling reported 
that when this variable was forced into the regression equa- 
tion, its' regression coefficient was never statistically 
different from zero. Darling also reported a Da rbin-Watson 
statistic of 1.08, indicating a positive serial correla- 
tion among the residuals. He felt that this was probably due 
to the lack of an attempt to explain the seasonal nature of 
Marine Corps enlistments to this point in his study. In an 
attempt to correct this problem, the Box-Jenkins methodology 
was applied. 

Darling utilized the data from the multipla regression 
model in a Box-Jenkins procedure to arrive at a final model: 

Yt= .8042511^.., SlY^.j,-?. 095902 +e^-. 47 8727eT-i2 

Where Y was the number of male high school graduates in 
mental groups I and II who enlisted in month t and e was 
the error term. Utilizing this model, he conducted two 
forecasts with varying degrees of success. The first fore- 
cast conducted incorporated the abnormally higi enlistment 
month of December 1976. This resulted in the distortion of 
the forecasts. For the second forecast, not iacorporating 
December 1976, the pattern of enlistments was well repre- 
sented by the model. 

Darling concluded his thesis by combining the two 
methods in applying Box-Jenkins to the residuals of the 
multiple regression analysis as if they were an original 
time series. He then used this combined model to forecast 
Marine enlistments with a greater degree of accuracy than 
with either model individually. 
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£. £il£ AND GILfiOY (1983) 



Dale and Gilroy ic two stodies of the economic detemi- 
nants cf military enlistment rates £Bef. 5 and 6] utilized a 
regression equation to forecast enlistment rates cf male 
high schccl graduates in mental groups I through III upper 
for the Amy. Ihe dependent variable was total contract 
signers divided ty tie relevant civilian population cf 16 to 
19 year elds. Iheir basic equations were estimated with a 
linear functional ierm using generalized least squares, 
correcting for the presence of first order autocorrelation. 
Iheir equation £cr all males was: 

A/P = -1S68.90 ♦ 16. 3S OB ♦ 7.77 UM-2 ♦ 4.59 DM-4 

♦ 40.89 W4 ♦ 313.07 Bill ♦ 408.40 VEAP 

♦ 6.26 KICK ♦ €4.65 Q3 

+ 819.73 GI ♦ 148.38 lABGEI - 0.00 BECBTB 

where : 

A = Army accessions plus OEP of male 

NPS high school grads in mental 
groups I through III upper. 

P = fievelant male population of 

16 to 19 year olds. 

UM = Current uneaployment rate of 
16 to 19 year olds. 

UM-2 = DM lagged two months. 

UM-4 = DM lagged four months. 

W4 = Batio cf first year basic military 
cempensatien to average 
weekly earnings in the private sector, 
with a four month lead. 

GI = Dummy variable = 1 in December, 1976 when 
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GI Bill €Xfired, and =0 everywhere else. 

Bill = flaxinum morthly benefit for a GI bill 

beneficiary without dependents, deflated 
by the consumer price index. Variable 
is set to 2 €Xo after December, 1976. 

VEAf = Maximum monthly benefit for a VEAP beneficiary 
without dependents , deflated by the consumer 
price index. Variable is set to zero before 
January, 1S77. 

Kici( - Maximum value of kicker payment (offered 

only tc Army enlistees who entered critical 
specialties in combat arms) deflated by 
the consumer price index. 

TAECEI= Binary variable = 1 from November, 1979 to 
August 1981 when high school grads were 
specifically targeted, and 0 elsewhere. 

Q3 = Seasonal dummy variable = 1 in July, August 
and September, and = 0 elsewhere. 

BEC£IB= Number of production recruiters. 

As with Darling's work, of some interest is the insig- 
nificant statistical results for the number of prcduction 
recruiters. Dale and Gilroy do not conclude that recruiters 
are unimportant, but that the variation in the number of 
producticn recruiters has not been large enough tc shew any 
correlation with enlistment rates. They include the variable 
”TABGIT" as a proxy fer the recruiters' efforts. 

In their study they determined that a rise in the unem- 
ployment rate of 16 to 19 year old males led to a signifi- 
cant increase in Army enlistments and that a drep in 
national unemployment from ten percent to nine percent would 
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cause Any enlistmects of male nonpcioi service high school 
graduates to fall by about 8.8 percent. They project that 
the elasticity of tctal Army enlistment rates with respect 
to relative pay is it the range of .9 to 1.7 percent for 
mental gicups I to III upper. They concluded that a rela- 
tive decrease in military pay of one percent in relation to 
civilian pay would cause enlistment rates to fall substan- 
tially. Also, they found a statistically significant 
increase in the rate of enlistments during the period 
November 1S79 to Augcst 1981 when the Army was committed to 
increasing the percentage cf high school graduates. 
Finally, they concluded that educational benefits are very 
important tc many high school graduates, including these in 
the highest mental groups. 

C. 6CI£££i€ AND GfiEEISlON (1983) 

Gcldherg and Gxeenston [Bef. 7] provided a progress 
report cn on-geing time series cross-sectional analysis 
research into the determinants of high quality enlisted 
supply. They contended that utilizing a regional data base 
rather than a national approach permited more accurate 
modeling of demographic factors and produced better esti- 
mates of the effects of changing labor market conditions. 
They asserted that by utilizing a regional approach, a 
greater capability tc optimize the allocation of goals, 
recruiters and advertising is achieved. 

A regression analysis was used to estimate the effects 
cf supply factors cn the number of contracts signed by 
nonprior service male high school graduates for each 
service. Annual data for the time period 1976T to 1982, 
using 43 Navy recruiting districts as units of observation, 
was evaluated. Seperate regression models were developed 
for mental groups I tc III upper and mental group III lower. 
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Ihey assumed the suffly of HSDG contracts in a recruiting 
district to be a leg-linear function cf supply factors in 
the form of: 



Ln B = 


1.51 + .75 lEFAY ♦ .05 RUNEM ♦ .15 LAVG 
- 0.26 VEAf + 0.43 lEOP - 0.0089 BLK 
♦ .0023 URfAN + 0.58 LNREC ♦ error term 


Bher € : 




L££iY 


= Icgarithm cf regular military compensation 
over a four year period, divided by full 
time eguivalent earnings of civilian youth. 
Military and civilian earnings are discounted 
at 30 percent. Civilian earnings are aged 
vith data cr median ueeJcly earnings of 16 
tc 19 year elds. 


R0K£B 


= Change in tie unemplcyment rate for all 
civilians. 


LAVG 


= Icgarithm cf the average unemployment rate 
fer all civilians in FY 1976-82. 


VEAf 


= Lummy variatle egual to zero in FY 1976T 
and one in fY*s 1977-82. It measures 
the net eflect of the change over from 
the GI Bill to VBAP. 


LECr 


= Icgarithm cf a district's 17 to 21 year old 
male population in thousands. 


BLR 


= percent of a district's 17 to 21 year old 
male population that is black. 


UFCJ 


= Percent of a district's 17 to 21 year old 
male population that resides in a urban area. 


LNEEC 


* Icgarithm cf Navy's production recruiters. 
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Is th€ results c£ their fisdings £oi mental groups I 
through III upper, they concluded that relative nilitary 
pay, cyclical unemployment, total population, urban mix and 
recruiters increase enlistments, while black populaticc and 
the less of the GI Bill cause enlistments to decline. They 
determined that a one percent increase in relative military 
pay would cause an increase of Navy enlistments cf 0.75 
percent and that a one percent increase in the unemplcyment 
rate would increase enlistments by between 4.3 and 5.1 
percent. Although they indicated that their results 
utili2ing regional unemployment were inconsistant, it 
appeared that ir general a rise In regional unemplcyment 
caused a rise in enlistments in the region. 

They feund that the loss of the Gl Bill caused a decline 
in the supply of target candidates of 26 percent tc the 
Navy. They additionally determined districts having greater 
population tended to have more recruiters resulting in their 
separate effects being difficult to measure. They did deter- 
mine that a doubling cf recruiters and population results in 
slightly more than a doubling of enlistments, and that a cne 
percent rise in the urban population increased enlistments 
by one tc two tenths cf a percent. Overall, thjey concluded 
that the research approach using annual time series, cross- 
secticnal data was fruitful and that additional research 
will enhance the usefulness of this model for forecasting 
and policy analysis. 
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III. AIILICAT ICN CF BOI-JE MK INS 



I. AHAIISIS SlEf OHE 

Ike Box-JenJcius methodology was applied to a mccthly 
time series of norprior service high school graduate males 
signirg contracts in the Alhuguergue, New Mexico recruiting 
district during the period October 1978 to Septemter 1S83 
(60 data pcints) . Ihe data was limited to those contract 
signers ir mental groups I through III upper and represented 
net ccntracts signed, that is new contracts signed less 
Delayed Entry Program (DIP) attrites. Figure 3.1 is a time 
series plct of this data. This plot suggests strong season- 
ality in the number cl high quality male enlistes with peak 
months appearing to he in late summer, predominately August, 
and in January. An "average" year was constructed by aver- 
aging tie individual months over the five year period. 
Figure 3.2 is the time series plct of this average year. Ihe 
average lor each month is denoted by the letter "A". Also 
depicted in the graph is the average of the contracts signed 
in the first three years of the data series denoted by "1" 
and the average of the last two years denoted by "2". Ihe 
average cf all the years ,"A", tends to support the assump- 
tion cf seasonality suggested in the time seris plot cl the 
data set. however, in the plot of the last two years alone, 
the suggestion for seasonality is not as clear which would 
tend to indicate that some process change has taken place. 

Cl seme interest is the apparent sudden increase in the 
mean number of contract signers over the approximate last 
third cf the series. Further analysis indicates that for 
the time series as a whole the mean number of ccntracts 
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Figure 3.1 Honthly Contracts Signed. 
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signed per month was 49.07. For the approximate first two 
thirds of the series the mean was 42.83 and for the approxi- 
mate last third the mean was 62. 53. Because of this unusual 
pattern^ two additional time series plots were constructed 
for this data. The first was a time series pLot for the 
first forty one data points (October 1978 to February 1982), 
figure 3.3, and the second was for the last nineteen data 
points (March 1982 to September 1983), figure 3.4. Although 
no obvious trend is apparent in either data subset, 
both figures do provide some indication of seasonality. 
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•Ih€ next Stef in deveiofing the models for high guaiity 
recruitment was to flot the autocorrelations for the data 
sets. figure 3.5 is the autcccrreiation plot developed by 
the Binitat general purpose statistical computing system for 
the period October 1S78 to September 1983. An examination of 
these autocorrelations indicate that non-stationary behavior 
in the process level (a trend) is evident in the the data. 
Ihe evidence of such behavior is indicated by having auto- 
correlations which dc not die out rapidly to zero. Although 
the autocorrelations for this data indicates a general trend 
in the data, the time series plots and the mean difference 
in the two data subsets tend to suggest not an overall 
trend, hut a sudden and sustained overall increase in the 
number of contracts signed by the target cohort. Ihis say 
he caused by some yet to be determined external influence. 
An evaluation of this sudden and sustained increase in 
contract signers will be addressed in Chapter Four. 

Iwc separate courses of analysis were pursued. First, in 
the event that the cause of the sudden increase in the mean 
could not he identified and Quantified, a model which was 
adequate to describe the data set as a whole was sought. 
Second, in the event that the cause could be identified, 
its* affect quantified and any future occurrence predicted, 
a single mcdel which satisfactorily described the underlying 
data patterns of each subset around their mean was sought. 
If the cause of this sudden change could be determined or 
controlled, then the second mcdel could possibly be used to 
more acccrately forecast future contract signers. 
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Eigare 3.3 Monthly Contracts (Oct 78 - Feh 82). 

£. AQIGCOSfiELATlOHS iHO PABTIAL AOTOCOfifiEIATlONS EXAMINED 
Cct cber J928 to Sep temher 198 3 

As €vid6nced by figure 3.5^ there is a suggested 

trend for this data set due to the increase in the mean 
number of contracts signed. As the Bor-JenJcins methodclogy 
requires stationary data to be effective, differencing i was 
used. Ibis data set was differenced once and the resulting 
autocorrelations and partial autocorrelations, as repre- 
sented ir figures 3.6 and 3.7, were examined. 



lEifferencing is accomplished by subtracting tie first 
value from the second, the second from the third, the third 
from the fourth, etc. Differencing is the most rapid method 
of reacving a trend from a time series [Bef. 8]. 

Erff erencing may he performed once or more in order to 
remove the trend, however differencing once is usually 
adequate . 
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Figure 3.4 Boutlil; Contracts (Bar 1982 tc Sep 1983). 

Ibree statistically significant data points are 
observed in figure 3.6: the first, the twelfth and the thir- 
teenth. Cverall, this plot suggests that the trend has been 
removed and that only horizontal, stationary, data remains. 
Ihe strong positive autocorrelation at the twelfth period 
supports the seasonality suggested by figures 3.1 and 3.2 

Cctcber 192€ to February 12^ and Mar ch J982 to 

September 

Figures 3.8 through 3.11 are the autocorrelations 
and partial autocorrelations for the periods October 1S78 to 
February 1962 and March 1982 tc September 1983 respectively. 
These plots suggest that both subsections of the data set 
are trendless, with a suggestion of seasonality evident only 
in the October 1978 tc February 1982 data. Identification 
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Figure 3.5 AUIOCOfifiELATIONS. 
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figure 3.6 AOICCOBfiSIillCMS OF DlFFEfiENCED DAIA. 
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Figure 3.8 ACIOCOBBEIAIICMS (OCT 78 - FEB 82). 



and estiiation of tie seasonal nature of these two subsets 
is ver^ difficult because of the brevity of the time series. 
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figure 3.10 AGIOCOBBEIAIIONS (MAfi 82 > SEP 83). 
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Figure 2.11 PiRTIlI AOTOCOfiBEIATlOIiS (HAB 82 - SEf 83). 

C. flCIfl OEVELOEHENl 

Ic ccofaring the autocorrelations and partial autoccrre- 
laticDS 1o the various ASMA models in figures A. 5 through 
A. 13, a moving average model is suggested for the data set 
for October 1978 to September 1983 as well as for each 
subset of the data, ficwever, which particular MA model would 
best describe the underlying data pattern in each case is 
unclear. 

1. Oc tober J978 to Sep tember 1983 

laking into consideration the apparent seasonality 
of the data set, the apparent trend in the data and the 
uncertaicty as to which specific HA model would be most 
appropriate , two models were developed for evaluation based 
on a visual inspection of the autocorrelations and partial 
autcccrielations. 



a. first Order, Seasonal, Moving Average Model witi the 
forr: 



= A^= 1 1' ©.B) ( 



(egn 3.1) 



t. Second Order, Seasonal, Moving Average Model with 
the form: 



= n-0,B-0jB2) (1-0,^Bi2)e, (egn 3,2) 



iihere e is the error term. 

2- Cctcber J978 tc Feb ruary 1982 

In developing the models to be tested for this data 
subset, the absence of an apparent trend, the suggested 
seasonality, and the uncertainty as to which particular 
loving average model would best describe the underlying data 
patterns were taken into consideration. Two models were 
selected for evaluaticn based on a visual inspection cf the 
autoccrrelations and partial autocorrelations. 

a. first Order, Seasonal, Moving Average Model with the 
fern; 



V = (1-e.B) (1-0,^Bi2)e^ (egn 3,3) 

h. Secend Order, Seasonal, Moving Average Model with 
the form: 
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(l-e,B-0^E2) (1- 0,^E^2)e, 



(eqn 3.4) 



i 



Kher€ e is the error term. 

- • Septe mber 1 983 

Id developirg the models to he tested for this data 
subset, the absence of an apparent trend, insufficient 
length in the time series to exhibit seasonality, and the 
uncertainty as to which particular moving average lodel 
would best describe the data patterns were taken into 
consideraticn. Twc models were selected for evaluation 
based on a visual inspection of the autocorrelations and 
partial autccorrelat icns. 

a. First Order ecving Average Model with the form: 



ltT= (1- 0,B)er 



(egn 3.5) 



b. Second Order boving Average Model with the form: 



'ir =(1- 0,f- Qp)Qr 



(egn 3.6) 



Rhere e is the error term. 

C. HCCII EVALOIIIOH 

Ihe models developed were evaluated utilizing the 
Hinitab general purpose statistical computing system command 
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"ABIiJi" (Autoregressive Integrated Moving Average). 2 in 
crder for a model to be considered adequate the residuals 
(errors) of the estimates generated by the model would be 
required to be randcr (unccrrelated) with a mean of zero. 
If each of the models developed for each specifio data set 
satisfied this requirement, the sum of squared errors and 
the I-ratics for each model would be examined to determine 
the most appropriate model. The results of this evaluation 
are contained in Table I. 

i. BISQIIS OF TfiE HCIEI ETAIOlIlOH 

1 . Cotcbe r J9 7 8 to Sep tem b er 1983 

As evidenced by table 1, both models corsicered 
satisfied the requirement of random residuals. There exists 
little difference between the two models to suggest which is 
the sere appropriate, although confidence in the second 
model is reduced by the low I-ratio for the MA(2) operator. 
In this case the principle of parsimony^ was invoked and the 
selected model was the MA(1) with the form: 



Y^ = (1-.6«4B) ( 1+. 448B12) Ot 



(eqn 3.7) 



(.101) (.ICi) std. error 



2The concepts and notations used by this com mand. fellow 
the ones developed by Box and Jenkins. The values of (p and Q 
are determined utilizing a nonlinear least squares algorithm 
developed by Marguart. The ABIMA command fits non-seasonal 
and seascnal models to a time series with input consisting 
of a stored time series and information as to the degree or 
the AB, BA or ARMA model to be fitted £Bef. 9]. 

3The principle of parsimony cautions to attempt to use 
the smallest possible number or parameters which represent 
the mcdel adequately £Bef. 10]. 
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2 . Cct cbe r 1921 iS Feb r uary 1982 

Ihe two models selected lor this data subset were 
run with results displayed in Tab>le I. The residuals for 
both models met the randomness requirement, however the low 
1-ratics fcr the noi-seasonal coefficients in both models 
was less than encouraging. As a result of these low 
T-ratios, another model, SHA(1)# was attempted. This model 
has the form: 



Yt = 0 ♦ (1- 0.^E^2) 



(egn 3.8) 



An ezaminaticr of Table I for this data set indi- 
cates little to distinguish one model from another in terms 
cf mere adequately describing the data patterns. Again the 
rule of parsimony was used. The model selected was the 
8UA(1) mcdel with the form: 

Y = h2.h€ ♦ { U.776B12) e^ (egn 3.9) 

(1.78) (.2C3) std. error 

3. garch 1.9^ tc S epte mber 1983 

Here, again, each model selected for evaluation 
satisfied the requirement fcr random residuals. The signifi- 
cant difference between these two models is in the residual 
sum cf squared errors, where the MA(2) model is clearly 
superior to the MA(1) model. Also encouraging is the signif- 
icant T-ratio exhibited by the HA (2) coefficient in the 
second mcdel. The mcdel selected for forecasting purposes 
was: 
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y = 62.S5 + ( 1-.25CB-.745B2)eT 

(0.23) (.2C6) (.208) std. error 



(egn 3. 10) 



I. fCBiCASTlNG NIIH IHE S£I£CI£0 MODELS 

Iatl€ II and figures 3.12 through 3.15 represent the 
results cf the forecasts of the selected models. The actual 
values fcr the net nuster of target cohort contracts signed 
and tie forecasted values are displayed within each table 
secticn. Ihe 95 percent limits displayed indicate that cne 
should he certain that the actual value will fall between 
these twc values 95 percent of the time. In figures 3.12 
through 3.15, the forecast results are displayed graphi- 
cally. Ihe actual net contracts signed are dipicted by the 
letter ”1”, the forecasted values by the letter '*£" and the 
upper and lower 95 percent confidence limits by the letters 
"U” and "L" respectively. 

lable XI is divided into four sections. Ihe first 
secticn, ccrresponding to figure 3.12, is a forecast fcr sir 
periods ahead fcr the period October 1983 to March 1S84 
based on the October 1978 to September 1983 time series 
utili 2 ing the seasonal MA(1)D model previously selected for 
this time series. Kith the exception of the March 1S84 
forecast, all actual values fall within the 95 percent 
confidence limits, however, these confidence limits are too 
vide to be cf much practical value. As evidenced by figure 
3.12, the model dees generally capture the pattern of 
enlistments for the forecasted period, although the degree 
of the decrease for March 1984 is not well represented. Ihe 
variaticn in the forecasting errors, ranging from a lew of 
♦5.0 percent to a high of +67.4 percent are not particularly 
encouraging. 
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Id the second section, corresponding to figure 3.13, the 
forecasts are conducted on a oonth to month basis utilizing 
the seguential* forecasting method. For example, the initial 
forecasts fcr this data set were conducted utilizing 60 data 
points. Hhen the 6 1 st data point became available, the 
parameters cf the same model specification were reestimated 
using 61 data points and a forecast for period 62 was 
conducted. Forecasts were conducted in this manner for the 
period Cctober 1983 to March 1984. As evidenced by section 
Two of the table, forecasting with this method produces 
superior results to forecasting multiple periods ahead. For 
the scst part, forecasted values well represent the actual 
values, although here again the model fails to adeguately 
represent tie sudden decrease for March 1984. 

In sections three and four of Table II, corresponding to 
figures 3.14 and 3.15 respectfully, the forecasts cf the 
HA(2) model developed for the March 1982 to September 1S83 
time series are examined. Section three, as in section one 
cf the table, is a forecast for six periods ahead. The 
obvicQS problem of attempting to forecast future periods 
with a ncn-dif f erenced MA model is the models tendency to 
forecast the mean cf the time series after only a few 
periods, precluding any meaningful evaluation of these fore- 
casts. Observing the first two forecasts for the period 
October 1983 to March 1984, the results are not very encour- 
aging. The errors fcr these two periods are much too large 
to be cf any practical application. 

In section four the forecasts were conducted with the 
sequential forecasting method utilizing the March 1982 to 
September 1983 time series. With only one forecasted value 



*As soon as the value of the next data point becomes 
available, all forecasts for future periods are updated 
utilizing reestimated parameter values of the AfiMA model, as 
opposed to the adaptive method in which the ABMA model 
parameters remain u Ic'Eanged . 
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falling within ten percent cf the actual value, and with the 
forecasts heing condccted onlj cne period ahead, the aodel 
would appear to have extremely limited usefulness. 

Cf tie two models developed, the Seasonal model 
developed for the October 1978 to September 1983 time series 
would seem to be the most useful in forecasting future net 
values of high quality recruits. However, as evidenced by 
lable II and Figures 3.12 and 3.13, sudden major swings in 
the values cf contracts signed are not adequately predicted 
by this model. In Chapter Four a causal model utilizing 
regression analysis will be examined for applicability, and 
to determine if sudden shifts in the dependent variable will 
be adequately described by a causal model. 
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TIBII I 

HOIil EV&IOAIION BESOLIS 



Octotjr JS2§ to September 1983 

MA( 1) D 


MA (2) D 




Besiduals: 


Ban dom 


BANDOM 




Mean: 


♦ 0. 80 


♦ 1.03 




Std Cev: 


8. 89 


8.66 




Saqiul acance Level 


for Hear: 


2. 32 


2.26 




fiesicual 


Sum of Sguares: 


461 9.61 


4416.89 




T-Eatios; 


UA <1} 


+6, 39 


+ 3. 46 




MA (2 


NA 


♦ 1.72 




SHA(1) -2.76 

October JS2i to febrcar^ li82 


-3.01 






MA ( 1) 


MA (2} 


S MA (1) 


Besiduals: 


Ban dom 


fiandom 


Bacdom 


Mear : 


+0. 76 


♦ 0.73 


♦ 0.75 


Std Dev: 


6. 27 


6.35 


6.37 


SiQDizicatce Level 


for Bear: 


1. 96 


1.98 


2.00 


Besicual 


Sum of Squares: 


1594.65 


1634.77 


1646.43 


I-Batios: 


MA ( 1) 


-1. 05 


-0.91 


NA 


MA 12 ) 


NA 


-0.03 


NA 


SMA ( 1) 


-3. 52 


-3.73 


-3.83 


CoDstant: 


42. 37 


42.43 


42. 4€ 


Actual meaii for times series: 42. 
March J182 ^ Sep teater 1983 
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MA(1) MA(2) 



Besiduals: 

Mear : 

Si* d Ep y • 

Sigoificacce Level 
for Bear: 


Ban dom 
-0. 02 
8. 48 

3. 89 


Bandom 
♦ 0.42 
7. 13 

3.27 


Besicual 
Sum of Squares: 


1294.39 


918.04 


T-Ha tios : 
MA (1) 

MA (2) 


♦0. 54 
NA 


♦ 1.20 
♦ 3.59 


Constant: 


62. 47 


62.95 


Actual aean for the 


time series: 


62.53 
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TIBIE 11 
FOBECASl BESOLTS 

OCICEEB 1S78 TC SEE1EMBEB 1963 DATA SET FORECAS TS 
FCBECAEIING SIX PEBIODS ABEAE 



95 PERCENT LIMITS 



MCMIfl 


FORECAST 


LOWER 


UPPER 


ACTUAL 


51 EEBOR 


OCT 63 


65.7 


48. 1 


83.4 


53 


+ 24.0 


NOV 63 


64.4 


45. 6 


83. 1 


52 


+ 23.8 


DEC 83 


65.8 


46. 0 


85.6 


59 


+ 11.5 


JAN 84 


63.4 


42. 7 


84. 1 


59 


+ C7.5 


FEE 84 


58.8 


37. 2 


80.5 


56 


+ 05.0 


MAE 84 


56.9 


34. 3 


79. 4 


34 


+ 67.4 


secdentiai 


FORECASTING METHOD 












95 PERCENT LIMITS 






MONTH 


FORECAST 


LOWER 


OPPEB 


ACTUAL 


% ERROR 


OCT 83 


65.7 


48. 1 


83.4 


53 


+ 24.0 


NOV 83 


59.8 


41. 9 


77.6 


52 


+ 15-0 


DEC 83 


58.5 


40. 8 


76.3 


59 


-CO. 8 


JAN 84 


56.6 


38. 9 


74.2 


59 


-04.1 


FEB 84 


53.6 


36. 1 


71. 1 


56 


-04.3 


MAB 84 


53.0 


35. 6 


70.4 


34 


+ 55.9 



MAECH 1982 TO SEPTIHEER 1983 DATA SET FORECASTS 



FOBICASTING SIX PEEIODS AHEAD 



95 PEECENT llfllTS 



MONTH 


FORECAST 


LOWER 


UPPER 


ACTUAL 


% ERROR 


OCT 83 


69.2 


54. 3 


84. 0 


53 


+ 30.6 


NOV 83 


58,8 


43. 5 


74. 1 


52 


+ 13.1 


DEC 83 


63.0 


44. 1 


81.8 


59 


+ C6.8 


JAN 84 


63.0 


44. 1 


81.8 


59 


+ 06.8 


FEE 84 


63.0 


44. 1 


81.8 


56 


+ 12.5 


MAE 84 


63.0 


44. 1 


81.8 


34 


+ 85.3 


SECDINTIAI 


FORECASTING METHOD 












95 PERCENT LIMITS 






MONTH 


FORECAST 


LOW EE 


UPPER 


ACTUAL 


% ERROR 


OCT 83 


69.2 


54. 3 


84.0 


53 


+ 30.6 


NOV 83 


61.3 


44. 2 


78. 5 


52 


+ 17.9 


DEC 83 


67.5 


50. 5 


84.5 


59 


+ 14.4 


JAN 84 


63.1 


46. 2 


79.9 


59 


+ 06,9 


FEE 84 


62.6 


46. 1 


79. 1 


56 


+ 11.8 


MAE 84 


61 .5 


45. 2 


77.8 


34 


+ 60.8 
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IV. THE BEGBESSIOH flODEX 



1. E151C ££SOaPIIOH£ 

It was assumed ttat the forecasts generated with the 
model presented in Chapter 3 could be improved u^cn by 
developing a model which took into consideration causal 
relationships. Ihe factors assumed to most actively influ- 
ence tie net number of high guality nonprior service males 
opting to join the liavy was conjectured to be: number of 

production recruiters, some measure of unemployment, size of 
the work force, military/civilian pay ratio and recruiting 
goals . 

Ihe utilization of the regression model reguires that 
four basic assumfticrs be made. First, that the dependent 
variable, net contracts signed, is linearly related to each 
of the independent variables. If the relationships are not 
linear, then regression analysis cannot accurately be 
applied to the probleo. Second, the variance of the resi- 
duals (errors) of the regression equation remains constant 
over the time series. Third, that the residuals are indepen- 
dent of one another, that is any given residual should not 
be a function of the error terms preceeding or following it 
in the time series. The existence of such a relationship, 
called serial (auto) correlation, implies that either an 
important independent variable has been omitted or the wrong 
functional form has been used in the regression equation. 
Generally for a data set of the size used in this study, a 
Curbir-iatson statistic value between 1.5 and 2.5 implies a 
lack of serial correlation. The final basic assumption is 
that the residuals are normally distributed. 






£. VIGIi£I£5 C0MSIC£££0 FOB IBCIOSIOB 
Con tracts Sl qne d ; CNTECTS 

As previouslj defined, the dependent variatle is the 
total numter of contracts signed per month by ncnprior 
service high school graduate males in mental groups I 
through III upper nithin the Albuguergue, New Mexico 
Eecruiting Eistrict less the numter of Delayed Entry Program 
(DEP) attrites of the same cohort during the same xonth. 
This cata was obtained from Navy Recruiting Command for the 
period October 1S78 tc September 1983. 

2. Ihe Number of Producticn Recruit ers; RCTR S 

Ihe values for this variable are the number of Navy 
Recruiters who were actually on duty in Navy Recruiting 
District Albuguergue during the period of the time series. 
They represent actual canvassers and exclude staff 
personnel. This data was obtained from Navy Recruiting 
Command. It would seem to be logical to expect an increase 
in the nurber of production recruiters to result in an 
increase in the number of contracts signed, subject to 
effects cf marginal jrcductivity. 

3. N umbers cf D remploy ed; ON EMPL D 

Ihe values fcr this variable represent the numbers 
cf people determined to be unemployed by the 0. S. 
Department of Labor. The values of this variable were 
divided by 10,000 so that the data could be manipulated mere 
easily. These values were divided by 10,000 so that the 
data could be manipulated more easily. These values were 
built up from a county level and represent the seasonally 
adjusted number of unemployed within the target recruiting 
district as a whole. No breakdown by sex or age group was 
available. This data was obtained from Navy Recruiting 
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Commard. It was assuited that an increase or decrease in the 
numbers of uneiplcjed would result in a corresponding 
increase or decrease in the number of contracts signed. 

4. num bers in the Work Force: WKFHC E 

Ihe values fcr this variable represent the numbers 
in the wcrk force, employed and unemployed, as determined by 
the 0.5. department cf labor. The values of this variable 
were built up frcm a county level and represent the season- 
ally adjcsted work force in the target recruiting district 
in total. This data was obtained from Navy Recruiting 
Command. As the values for this variable increased or 
decreased, it was expected that the numbers of contracts 
signed would increase or decrease correspondingly. 

5- M ilitary /Civi lian Pajr Batio: PAIRATIO 

The merits cf any constructed military / civilian 
pay ratio could be argued indefinitely with no concrete 
results as to a most meaningful measure. For the purposes 
of this study, the military component of the ratio was 
calculated as monthly base pay for an E-1 with mere than 
three months service. The civilian component utilized was 
average weekly earnings of production workers in the state 
cf New Mexico multiplied by four. The average weekly earn- 
ings figure was obtained frcm "Employment and Earnings" 
published by the 0.5. Department of Labor; Bureau cf labor 
Statistics. Intuitively, an increase in the military/ 
civilian pay ratio wculd result in an increase in contracts 
signed. 

6 . Pe r cent Onemrl oved ; PEBCONE 

The values of this variable are UMEMPLD/WKFRCE. This 
variable represents the percentage unemployed in the target 
recruiting district. This variable was included in the study 
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to d€t€rniD€ if the percentage of unemployed within the work 
force acre accurately predicted the number of contracts 
signed than the crdiral value cf unemployed. 

7- Chan ge in On employment; DELTAUN 

Ihe Values for this variable represent the month to aonth 
changes (+/~) of the variable PEBCUNE for the target 
recruiting district. This variable was included in the 
study to determine if changes in the unemployment rate could 
signal corresponding changes in the number of contracts 
signed. 

8* 8n employment Batio ; UNE MRAI 

Ihe values of this variable represent a ratio of the change 
in uremplcyment frci month to month within the target 
recruiting district as expressed by the following formula; 

ONEtPlD in month T - UNEMPLD in month I- 1 

DKEMBAI = 

ONEMPLD in month T-1 

It was hoped that this variable could capture the effects of 
both the percentage unemployed and the change in unemplcy* 
sent cn a monthly basis. 

9 • Goals - Cuo tas 

Unfortunately, there was insufficient data cn goals 
for the target cohort to examine goals for inclusion in the 
regression equation. Intuitively, goaling would seem to play 
a key role in determining the number of contracts signed. A 
recruiting district which had achieved target quotas for the 
month wocld be tempted to delay additional accessions until 
the fcllcwing month to help insure the attainment cf goals 
cn a continuing basis. Additionally, goals would seem to 
mask the supply function of recruiting making the study of 
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the suffly protlem much mere difficult. ^ it is the 
hypothesis of this author that a detailed study of the 
effects of goals on enlistment contracts signed would indi- 
cate a very high positive relationship bewteen these two 
variables. Ihis is ar area which demands increased attention 
and studj. 

C. !I££ EEGBESSIC8 flCIEL DEVEICFED 

In determining the final choice for the explanatory 
variables, it was recessary to determine the lag or lead 
effect, if any, that was exerted on the dependent variable 
by each independent variable. For example, an increase in 
the number of production recruiters may not affect the 
number of contracts signed for several months, assuming that 
there is a learning curve associated with the pcsiticn. 
Similarly, an increase in the unemployment rate may not 
affect erlistments fer several months. All of the indepen- 
dent variables previously described were given a lag and 
lead ci from one to six months and simple regressions were 
conducted tc determine which increment of lag or lead best 
explained the variation in the number of contracts signed. A 
variable was allowed to enter the equation only once. That 
is if, fer example, PERCDKE evidenced a significant influ- 
ence cn CMTFCTS lagged one month and lagged two months, orly 
the lag with the most significant coefficient of determina- 
tion, E-sguared, was included in the regression equation.* 



*£iegel and Eorack [fief. 2] provide an excellent dicus- 
sion ci the interaction between goals and the recruiting 
effort. 

«lbe decision to utilize a single lag or lead for each 
variable was made prior to conducting the initial regres- 
sion. However, during the the analysis, all variable lags 
and leads with a T-ratio greater than 2.00 for a simple 
regression were fitted into a multiple regression. The coef- 
ficient cf determination of this multiple regression was not 
significantly different frem the coefficient of determina- 
tion for the multiple regression utilizing a single test fit 
lag cr lead for each variable. 
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A lEuJtifle regression model utilizing the lag or lead of 
each variable with the highest coefficient of determination 
was constructed, and a regression conducted on the dependent 
variable net contracts signed with results as follows: 

Y= -42.6 ♦ .405 fiClESI + 6.0 UNEMPLDl - 1.0 HKFBCEl 

♦ 256. C PAYBAT04 - 241.0 PEBCUNE2 ♦ 28.2 DELIAUK 

♦ 16.8 0NEMBAI2 ♦ a (egn 4.1) 



where : 



BCIBS1 

UNEEPIC1 

WKIBCE1 

PAYBAIC4 

PEEC0NE2 

OEIIAON 

UNEEBAI2 

a 



number of production recruiters lagged 1 month 

unemployment lagged 1 month 

numbers in the work force lagged 1 month 

military / civilian payratio lagged 4 months 

percent unemployed lagged 2 months 

change in unemployment with no lag 

unemployment ratio lagged 2 months 

residual 







ST. DEV. 


I-BAIIO= 


COlDilN 


COEFFICIENT 


OF COEF. 


COEE/S.t 


COKSIAKT 


-42.6 


158.9 


-0.27 


HClfSl 


0.4051 


0.2668 


1.52 


0NEKPLC1 


5.S7 


17.81 


0.33 


HKFBCEl 


-0.51 


1. 187 


-0.43 


PAYEAIC4 


256.17 


96. 18 


2.66 


PEBCUNE2 


-241 


2498 


-0. 10 


DEIIAON 


28.23 


10.26 


2.75 


UNEKBA'12 


16.81 


11.52 


1.46 


IHE SI. LEI 


. OF I AECCI THE 


BEGBESSION LINE IS 




£= 8.030 








B-SCOABEC = 


62.2 PEBCENI 






E-SCOiEEI = 


56.7 PEBCENI, AEJOSTED FOB D.F. 
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ANALYSIS CF VABIANCE 



DUE TO 


DF 


SS 


aS=SS/DF 


EEGBESSICN 


7 


5097.30 


728. 19 


EESIECAL 


48 


3094.84 


64.48 


TCTAI 


55 


6192.14 





Those variables kith negative T-ratios were excluded 
irois consideration in the iinal regression model as having 
counter intuitive results. The regression was conducted a 
second time utilizing the variables: fiCTBSI, (JNEhfLCI, 

FAYBAT04, LEITAUN, and UNEHBAT2. The results o£ this seccnd 
regressicn were exasined and those variables having a 
T-ratio^ less than 2.00 were eliminated from consideration 
in the final model. A third multiple regression was then 
conducted with the following results: 

Y=-6 1.8 + 4. ONEMPLDl + ig:*. PAYBATC4+19. 8DELTAUN+a (egn 4.2) 
where : 

0NEEPIL1 = uneiplcyaent lagged 1 month 
PAYEAIC4 = militar j/civilian payratio lagged 4 months 
DEIIAUK = change in unemployment with no lag 
a = residual 







ST. DEV. 


T-BATIC 


CCLDHN 


COEEFICIENT 


OF COEF. 


COEF/S.D 


CCKSTANT 


-61.84 


35. 11 


-1.76 


UNEMFLD 1 


3.670 


0.545 


6. 61 


PAYBAT04 


193.26 


90. 35 


2. 14 


CEITAON 


19.785 


9.410 


2. 10 



THE S3. LEY. OF Y ABCCI BEGBESSION LINE IS 
S = 8.112 



^A high T-ratio indicates that the independent variable 
is impcrtant in explaining the value of Y. The higher the 
T-ratic, the more unlUcely that the b coefficient is a 
randcs variation froi zero. 
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E-SCOiBEE = 58.2 PEBCENI 

£-SCD1B£I = 55.8 PEBCENI, AEJDSTED FOB D.F. 
ANAIISIS OF VARIANCE 



DUE IC 


DF 


SS 


HS=SS/DF 


BEGBESSICN 


3 


h770.27 


1590. C9 


EESIECAl 


52 


3<<21.88 


65.81 


lOTAI 


55 


8192.15 





Ihe signs of the coefficients of the explanatory vari- 
ables aie what one wculd expect intuitively. As unemplcyment 
increases, so does the number of contracts signed. An 
increase in the military/civilian pay ratio will cause an 
increase in the number of contracts signed, and as the 
percentage unemployment changes, the numbers of contracts 
signed change accordingly. 

Gne cf the mere surprising results of the regression was 
the exclusicn of the variable ECIfiS, representing the number 
cf prcducticn recruiters. nhen forced into the regression 
equation, the coefficient for this variable was never 
significantly differed from zero. Experience and intuition 
would indicate that recruiters play an important role in the 
numbers cf high quality nonprlor service males enlisting in 
the Navy. 

Ihere may be several explanations for the results cf the 
regression in regards to recruiters. In examining the time 
series plots for CMTECI5 and BCIB5 contained in Appendix B, 
it is evident that ever the approximate last half cf the 
period the numbers cf contracts signed is increasing as the 
number cf production recruiters is decreasing. It is 
possible that as the Navy gains more experience in the 
selection cf recruiters, better recruiters are placed with 
the recrclting districts, resulting in higher individual 
recruiter production. Also, it is possible that the various 
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services compete with each ether for the target cchcrt with 
approiiaat ely egual determira tion thus cancelirg the 

apparent overall influence cf the recruiters. It is also 
possible that the target cohort is not as easily influenced 
ly recruiters as "Icwer guality" recruits, as the target 
cohort wculd seen tc have a greater selection of civilian 
occupations. The individuals in the target cohort may have 

already made the decision to enlist based on eccncmic 
factors prior to contact with a production recruiter, and, 
it is pcssihle that recruiters really don't have a signifi- 
cant impact when compared to the impact of unemployzert and 
comp ecsaticn. 

The studies discussed in Chapter Two found little 
significance between the numbers of production recruiters 
and the number of enlistments. Perhaps the services in 
general and the Navy in particular have passed the point of 
decreasing returns as far as recruiters are concerned. There 
may be sene optimal level for the number of production 
recruiters such that additional production recruiters over 
this level are not beneficial in enlisting additional 
members cf the target cohort. It may be more cost effective 
to invest in advertising than in additional recruiters. The 
area cf costs and benefits of recruiters versus advertising 
is an area which deserves additional study. 

The emclusicn of the variables WBKFBCE, PEBCONE and 
DNEHEiT is not as discouraging. As evidenced in the regres- 
sion models presented, when these variables where included, 
there was an adverse impact on the T-ratio for the variable 
QNEBPID1. Since the values of the variable UNEHPID are 
included in the variable BBRfBCE, and the values cf the 
variables PEBCONE and ONEdBAT are constructed from the vari- 
able ONEBPIB, multicclinea rity among these variables may be 
a factor, and is suggested by the regression results. The 
effects of unemployment and the change in unemplcyment 
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appear well represeited ir the fiaai regression equation 
fcithcut including these three additional variables. 

£. ££51£011S EZAMIBli;. 

Ite Eurtin-Watson statistic for the regression was 1.52. 
As previcusly stated, a Durbin-Batson statistic between 1.5 
and 2.5 normally indicates a lack of serial (auto) correla- 
tion for a cata set cf the size used in this study. With 
three independent variables and a sample size of 60, a 
Eurbic-Watscn statistic between 1.69 and 2.31 would assure 
no serial correlation in the residuals. The obtained 
statistic cf 1.52 falls in the inconclusive range. Since 
this test is inccnclusive, additional study is warranted. 

The standardized residuals and the raw residuals were 
plotted in figures 4.1 and 4.2 The value of a standardized 
residual was determined by dividing the residual by its 
estimated standard deviation. Of particular interest here 
is ary value outside ♦/” 2. 00. In Figure 4. 1 there are only 
two values that lie cutside the specified limits. Since the 
cverwheliing number of residuals lie well within these 
limits, nc significant bias is indicated. In Figure 4.2, the 
residuals are plotted. The residual mean is -0.00058120 
with a standard deviation of 7.39. Framing Figure 4.2, there 
is nc cbviocs suggestion of a pattern in the data. Bowever, 
the Eurtin- Watson statistic of the final multiple regres- 
sion mcdel requires that further analysis be conducted to 
determine if serial correlation erists. In the fcllcwing 
chapter, the residuals of the regression will be examined 
further. Combining the regression model developed in this 
chapter with an ARIBl process as described in chapter 3 
would provide an impicved model, assuming serial correlation 
in the residuals. If no serial correlation exists, then the 
regressicn model alone would prove to be the superior mcdel. 
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V. IHE CCHBINED ^DEl 



1. CCEEIIEI HODEL OiVELOPES 

The iLuItipl€ reciessioo aodel developed Id Chapter 4 
ratislied all requirements except for the possitilitj of 
serial ccrrelaticn ir the residuals. In order to determine 
if in fact a serial correlation problem existed and, if so, 
to attempt to correct this problem, the residuals uere exam- 
ined further. The hypothesis was that if an AfilEA model 
could be developed fci the residuals, then a serial cocrela- 
tion problem existed. However, this same model could in turn 
be applied as a correction to the regression model for more 
accurate forecasts. 
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figure 5.1 BESIOOAl AOIOCOBBELATIOHS. 
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Ifc€ ttultiple regression model residuals are displayed in 
figure 4.2 Ihe aut cccirelat ions are plotted in Figure 5.1 . 
In tiis figure only the autocorrelation at lag 13 falls 
outside V 2 standard errors. However, no physical inter- 
pretation, such as seasonality, can be attributed to this 
spike, and it can be interperted as not being significant. 
Since tiere is no clear indication of a nonstaticnary 
pattern in the data, a white noise model was indicated as 
most appropriate. This model has the form: 

= Sij (egn 5.1) 



where: 

Zy. ~ the residual value at time t 
a^ = a random shock at time t 

This indicated that a model could not be developed for 
the residuals and, when the attempt was made to fit various 
AfihA models to the data, no AflUA model could be found that 
was appropriate for the non differenced residual time series. 
This strongly suggests that no serial correlation existed 
for the residuals of the regression model developed in the 
previous chapter. 

Eeexamining figure 5.1, there is a sligh t indication of 
a trend in the data if the general shape of the plot between 
lags 1 and 13 are observed. In order to explore this 
further, the residual time series was differenced once, with 
the resulting autocorrelations plotted in Figure 5.2 
Examining this figure, the immediate impression is that the 
data has been over differenced. This is suggested by the 
large negative spike at lag 1. Differencing a stationary 
series produces another stationary series. However, the 
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model generated ty tie over differenced series will be mere 
comflicated than a model generated by a stationary series 
obtained with the mirimum amount of differencing [Bef. 10]. 
At this yoint it was decided to continue with the attempt to 
construct a combined model, even though it was realized that 
the possibility was small of oonstructing a combined model 
which would prove superior to the regression model. 
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figure 5.2 DIFfiBEBCEC BESIDOAL AOTOCOBBElATlOIiS. 

lie partial autocorrelations of the differenced residual 
time series are displayed in figure 5.3 . Based on a visual 
inspection of the autocorrelations and partial autocorrela- 
tions a second order autoregressive (AB2) model was 
initially selected for testing. However, the AB2 model 
proved not to be appropriate. Since any moving average <HAg) 
process can be rewritten tc form an AB process, the next 
step was tc test a HA2 model on the differenced time series. 
Ihe tested model had the form: 
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Z,-Zr.,=AzT = a, S- 



(€gD 5.2) 



Shere ly is the residual value from the multiple regression 
model at time T and e^ is the error term for the MA2 model. 
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figure 5.3 PAfillAl AOIOCOBEELATICIiS. 



TA£X£ III 

BESICOAL HOOEI EVAIOATION 



fiesiduals of model: 
Dean: 

Std Dev: 

Significance level 
fcr mean: 

TIPE ESIIMATE 

BA 1 0.6472 

BA 2 0.3C60 



fiaodom 

0.34887 

7.82 

2.00 

SI. DEV. 

0. 1324 
0.1371 



I-BATIC 

4.89 

2.23 



lafle III suggests that the model is appropriate for the 
residual time series. The ABBA model for estimating future 
errors cf the regression model then becomes: 
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= (1 -.^<472 B -.306 B2 ) 



(eqa 5,3) 



Rher€ is the estimated error of the multiple regres- 

sion model at time t and Zy = /\ Z r ♦ 2fi 

vher e : 

Zt* = predicted value of the residual at time t 

/\Z^ = change in the residual value from time t-1 

Zf-j = value of the residual at time t-1 

letting egual the predicted value of the multiple 

regression model aloie, the predicted value of the confined 
model, My , then becomes: 

= Yy ♦ Zy (egn 5.4) 

Hcvever, in exanining eguation 5.3 and taking into 
consideration the standard deviation for the values of 0, 
and 0^ , it is conceivable that eguation 5.3 could be 
factored as: 



^Z^=(l-E) (1+0. 35B) ey (egn 5.5) 

Cancelling cut the differenced values, the eguation feccmes: 
Zy = (U0.35B)eT 



6 2 



(egn 5.6) 



Equation 5.6 indicates that ths time series was over differ- 
enced and suggests csing a MA 1 model on the nondif ferenced 
residual tine series. However, this had previously been 
attempted was found ret to te appropriate. To this pcict in 
the asalysis of the residuals of the regression model, all 
tests ccrducted continue to indicate that the residuals are 
random and that no serial correlation exists. The only test 
remaining was tc cenhine the regression model previously 
constructed with the HA2 model constructed alove and to 
compare the results cf each models forecasts, 

£. 1E£ fOSECASl COBflBlSOHS 

Three models have been developed in order to forecast 
net new contracts signed. The best test of each model would 
have teen tc forecast some menth or series of months subse- 
guent tc September 1S83, the last month of the time series, 
and tc compare the results. Due to the nonavailability of 
appropriate data this was not possible. As an alternative, 
the mcdels were used to forecast the last five months of the 
time series. Bay 19S5 to September 1983, since the results 
were known and the data reguired for forecasting with both 
the multiple regression and the combined model was avail- 
able. The seguential forecasting method was utilized for 
all three models with results as contained in Table IV 

As measured by the root mean squared error, the regres- 
sion model is superior to the ABIBA model and slightly 
superior tc the combined model. If the residuals of the 
multiple regression model were serially correlated, then the 
combined model should have corrected for this problem and 
produced forecasts superior to the multiple regression 
model. The forecast results serve to confirm that the 
multiple regression residuals were not serially correlatsd. 
The superiority cf the regressicn model comes primarily from 
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TlBll 17 

fCBECASl CCHPABISONS 



MONIB 


ACTUAL 


ABIMA 




REGRESS 


CCMEINED 


MAY €3 


59 


60- 1 i 


[+01 .85?] 


63.3 1 


[♦07.35?] 


68.0 


(♦15. 85? 1 


JUN 8^ 


60 


62. S 1 


>04. 85? 


i 63.4 1 


♦ 05.75?' 


59. 8 


(-00.35?' 


JUl 83 


56 


78. C ( 


♦ 39 .35? 


1 67.0 


>19.65? 


68.3 


(♦22. 05?' 


AUG 83 


49 


61.8 1 


♦ 26. 15? 


1 64.6 1 


♦ 31.85? 


1 57.7 


(♦17. 85?' 


SEP 83 


75 


53. C ( 


-29.35? 


1 74.9 1 


-00.15? 


64.8 


(-13. 65? 



SOM CF SCDIEED FfifiCFS: 
AfiliA BOCFL : 1141.46 

EEGFESSICN MODEl: 394.42 

CCMEINEr MODEL : 412.06 



BOCI MEAN EQOABED EfFCES (BMSE); 
ABIHA MODEL : 16.90 

REGFESSICK MODEL: 9.93 

CCMEINED MODEL ; 10.15 



HHEFE BMSE = j^ilr ” ^ 



U - 1) 



= actual value 

A 

Y^ = predicted value 
n =5 (rumber of ctserva ticss) 

the atillt^ to relate sudden shifts in the dependent vari- 
able tc oeasures cf uneaployment as exhibited t; the 
predicted values for Septeaber 1983. The ABIMA process of 
the ccislined model has a dampening tendency on the 

predictions when there are sudden shifts in the variatles. 
The regression process alone would then become the preferred 
model during a period of economic fluctuation because of the 
exhibited superior ability to capture these sudden shifts in 
accessions. 

Tie difficulty in forecasting with the multiple regres- 
sion icdel and the combined model is that one must also 
forecast future values of the change in percent of unemplcy- 
ment, the variable DEITAUN. Even with forecasted values of 
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CELIACN, th€ forecast horizon is limited to one month due to 
the one mccth lag for unemployment, the variable UNEtPiD, 
unless future values for that variable are also forecasted, 
for any period beycrd one month, the forecast for net 
contracts signed then depends on the accuracy of these two 
additicnal forecasts, decreasing the face validity cf the 
predicticn due to coiipounding negative effects. The attrac- 
tiveness of the Eox-Jenkins approach is that only the values 
cf the tine series are taken into consideration in the fore- 
casting process. However, the basic assumption of continuity 
in the pattern of tie data must hold generally true if the 
results are to be of value. 
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VI. CCNCIOSIONS 



i, £EHH1£1 

lir€€ nicdels utilizing t«o distinct analytical methcdcl- 
ogies have teen presented. Ihe Box-Jenkins methodology uas 
applied to a monthly time series o£ net enlistments io the 
ilbuguexgue. New Hexico recruiting district and a todel of 
the underlying data patterns developed. A multiple regres* 
sion Bcdel atteupting to examine causal relationships vas 
derived tased on intuition and the statistical significance 
of the independant variables. A combined time series/causal 
model was developed treating the residuals from the regres- 
sion model as an original time series and applying the 
Eox-Jenkins technique to them. Forecasts were conducted with 
the three models and the results examined. 

A comparison of the root mean squared error of the resi- 
duals of the three models indicated that the multiple 
regression model was superior to both the Box-Jenkins model 
and the combined model. Accuracy favored the multiple 
regression approach, however the need to develop forecasts 
for two principle independent variables limit its* useful- 
ness in forecasting. A need to forecast extended periods 
would favor the Box-Jenkins approach since there does not 
exist a requirement to include the independent variables of 
the multiple regression model. Both the multiple regression 
model and the Box-Jenkins methodology can serve useful 
purposes when the irherent limitations of each are taken 
into consideration. 
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E. IlfllCAIIOiS OP lEE fiESOIIS 



Ih€ aultiple re^iessioxi results suggested three vari- 
ables were primarily respoDsitle for the variaticr. ir. the 
ret Quober cf contracts signed. Ihese were the ordinal value 
of unemf Icyment, the change in percent unemployed and the 
military / civilian fay ratio. Of these three variables, 
the Eefaitaent of Defense (CCD) exerts only a partial influ- 
ence ever the militaiy/civilian pay ratio. As alluded to 
previcusly, no single measure of the military/civilian pay 
ratio car be justified as a "best'* measure of this relation- 
ship. Ibe one presetted in this study was felt tc be an 
adeguate estimation cf how young men making an occupational 
choice view the relationship between military and civilian 
compensation. This ratio did not take into account such 
forms cf compensaticn as medical benefits, enlistment 
bonuses, cemmisary and exchange privileges, special pays, 
and ether benefits. Each individual places different values 
on these additional forms of compensation depending on 
his/her cwn needs and desires. It may be that these ether 
forms cf cempensatior provide the meirginal benefit that sway 
a majority cf enlistees towards an occupation in the mili- 
tary. The effects of these additional forms cf compensation 
have been the focal pcint fer many studies and their effects 
deserve continued research. 

As fer unemployment and the change in percent unem- 
ployed, little can he done by DOD to influence these vari- 
ables. Indeed, it is unreasonable to assume that anycne 
would desire to place the economy of the nation in peril 
simply tc ease recruiting short falls. However, it is clear 
that general economic factors play a pivotal role in the 
cccupaticnal choice process and that the effects of changes 
in the econcmic indicators can have an immediate impact. A 
sudden increase in civilian job market opportunities for the 
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target cebert would have an iasediate and adverse iipact on 
Navy recruiting. During periods oi general economic improve'' 
ment^ mere resources will be reguired to obtain similar 
numbers cf high quality enlistees than during a period of 
general economic decline. 

Eemaining unexplained in the analysis is the shift in 
the time series from apparent seasonality over the first two 
thirds of the series to apparent nonseasonality over the 
last third of the series. Seasonality in recruiting with 
peak months in August and January can most easily he 
explained by the school year. Intuitively, a young man would 
not desire to graduate from high school in June and directly 
enter military service. Or the average, one would expect 
him tc desire to remain among his friends for the Summer 
months, entering the service at the end cf Summer when 
friends either return to school or enter the job market. Ibe 
peak month of January can be intuitively explained by the 
potential recruits desire to remain among family and friends 
during the holiday season prior to entering the service. 
Ehat is not readily explained is the virtual absence of this 
seascrality over the last third of the time series. 
Bemembering that the dependent variable is contracts signed 
and net accessions, what may explain the change in the data 
pattern is a change in the gucta system. Utilizing the CEP 
pool, the potential recruit is currently able to sign a 
contract with the Navy and delay entry into the service for 
a period cf up to cne year. This may have the effect of 
dampening the seasonal pattern exhibited by the first two 
thirds of the time series. However, this is pure conjecture 
as there was insufficient data available to conduct further 
analysis or draw any definite conclusions. 
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C. BiCCBfllNDATICNS fCB FOTOflE STDDI 

Ite scd€ls presented in this study should be refined and 
updated. Ihe 60 data foint time series utilized is tco brief 
to accurately depict cyclic flucuations in the data set. Ihe 
time series should be expanded as additional data feints 
become available. Estimates fxcm models developed from time 
series of adequate length, in excess of 100 data points, 
could pxcve beneficial in allocating quotas and recruiting 
resources. Begressicn analysis, examining causal relation- 
ships on a recruiting district level, should continue in 
order that a better understanding of the effects that local 
economics have on the supply of potential enlistees can be 
obtained. Cne area cf potential intrest may be the applica- 
tion cf leading indicator models, an extension of the 
Eox-Jenkins methodology, to the problem of predictirg the 
supply of recruits. 

Cue tc lack cf sufficient available data, several refin- 
ments in this study which the author feels are important 
could not be made. first, in the area of unemployment, an 
accurate breakdown by sex, age, and ethnic group by 
recruiting district would have been of significant benefit, 
lo make an appreciable improvement in the understanding of 
the interaction of local economic conditions on the supply 
cf potential enlistees this data will be necessary for 
future studies. Some sort of working agreement with the 
Cepartment cf Labor; Bureau cf Labor Statistics would be 
necessary in order tc obtain the data in a useful ferm. A 
restructing of the recruiting districts would make data 
gathering easier. A realignment along state boundries would 
make close coordination with state governments mere readily 
avialable. There could be a cooperative effort between the 
state unemployment office and the recruiting district. 
Although seme states would be reluctant to act as a 
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Cf farticular benefit would be a regional studj enccm- 
fassing accessions into all four branches of the nilitary 
and tie Coast Guard. A study of this type would permit the 
examination of the interaction of the different recruiting 
efforts. Ihis would assist in determining if an increase in 
the recruiting efforts of any one service served to increase 
the tctal number of accessions, or merely drew away pcten- 
tial recruits frcm one of the other services. Additic tally, 
by ccaparing the recicnal economy to total military acces- 
sions for the region, a clearer understanding of the effects 
cf changing economic conditions could be obtained. 
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APilNDIX A 

PfilHEB CN TIBS SEBIES AHALY5I5 



A. IXflE SEBIES ANAIISIS 

£xc€ft Kliere otbciwise noted, this appendix is based on 
the Ncrks of S. Bakiidakis and S.C. Wheelwright. A time 
series is a collection of observations made sequentially 
in tiae. Examples ocour in a variety of fields from physics 
to physiology and methods of analyzing time series consti- 
tute an important area of statistics. Statistical techniques 
for analyzing time series range from the straight forward to 
the very sophisticated. However, traditional means of time 
series analysis are primarily concerned with decomposing a 
series into the basic components of trend, seasonality, 
cyclic changes and other fluctuations. 

Ihe first step in analyzing a time series is to plot the 
observations against time. Figure A. 1 is a time series plot 
of the sales figures for a certain engineering company over 
successive months [ Bef • 11]. This plot can be used to 
obtain simple descriptive measures of the main properties of 
the series. For example, locking at Figure A. 1, it can be 
seen that there is a regular seasonal effect with sales 
*high* ir the winter and *low* in the summer. The graph 
suggests an upward trend in annual sales. For some time 
series, the variation is dominated by such 'obvious* 
features, and a fairly simple model may be perfectly 
adequate to describe the variation in the time series, for 
other time series, more sophisticated techniques will be 
required to provide adequate analysis. 

When observations are taken on two or more variables, it 
may be possible to use the variation in one time series to 
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1000 




figa£€ A.1 Hcnthly Sales. 

explain the variation in another time series. This say lead 
to a deeper understanding of the underlying causal relation- 
ships in a process and permit accurate forecasts of future 
values of the variable of interest. This may in turn aid in 
the cortrol of that future value. 

The ability of a given technigue to forecast effectively 
in a specific situation depends largely on accurately iden- 
tifying the patterns underlying the data and selecting the 
proper technigue to handle them. All time series analysis 
technigues assume that the patterns underlying the data are 
constant over tvc periods; the period over which the data 
was collected, and the period of the forecast. However, 
different technigues are available to fit a wide variety of 
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situaticcs. The factors that need to be considered when 
selecting a model are as follous: 

1. Ibe time horizon for decision maicing 

2. Ihe pattern of tte data 

3. lype ci model desired: tiae series or causal 

4. Ihe value of the forecast 

5. Ihe accuracy that is required 

6. Ihe complexity that can he tolerated 

7. Ihe availability of historical data £Be£. 8] 

£. lEI EOX-JEIKIIS BEIBODOLOGI 

1. Ihe Adva nta ges of Bo x- J enkins 

Ihe Box-Jenbins methodology is an efficient and 
practical procedure for handling time series and other fore- 
casting situations in vhich a variety of complex patterns 
exist. Ihis methodolcgy can handle complex patterns of data 
using a relatively veil specified set of rules that dc not 
require that a model he chosen on the basis of an analyst*s 
experience, intuitive ability or theory. 

Eox-Jenkins is the most general method of 
approaching time-series forecasting. Ihere is no need to 
assume iritially some fixed pattern to date. The approach 
begins hy assuming a tentative pattern that is fitted to the 
data so that the error will be minimized. It additiorally 
provides explicit information to enable the user to judge 
whether the pattern tentatively assumed is correct for the 
situation under study. If so, the forecast can be developed 
directly and, if net, the Eox-Jenkins approach provides 
further clues for identifying the correct pattern. This 
procedure allows the user to arrive at a forecasting model 
that achieves optimization in terms of the basic pattern and 
minimizes the forecasting error. The user is also supplied 
with statistical information on the accuracy of the fore- 
casts . 
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2. Gen eral Apc roac h 

In describing their af^roach/ Box and Jenkins have 
develcfed the scheaatic diagram shown in figure A. 2 
£Se£. 12j. This af^roach divides the problem into three 
stages. In stage one, a specific model is suggested as the 
forecasting method best suited to that situation. In stage 
two, that suggested acdel is fitted to the historical (tine 
series) data and the suggested model is checked to determine 
its adeguacy. l£ the model is not adequate, the analyst 
returns to stage one, and an alternative model is proposed, 
iihen an adequate model is identified, the development of a 
forecast fcr some future time period is conducted in stage 
three . 

3. Ihe Bole of A utoco rr elat ion 

An autocorrelation coefficient is similar to a 
correlation coefficient except that it describes the associ- 
ation among values cf the sane variable but at different 
time periods. The autocorrelation among successive values of 
the data is a key tcci in identifying the most appropriate 
model ccrr esponding to the basic pattern of the data. 
Correlation implies nothing about a change in one variable 
caus ing a change in tbe other variable. 

To further demonstrate, we can construct an artifi- 
cial variable from arcther variable by changing the time 
origin cf the data. for example, dropping the first value 
cf *A* and letting tbe second value be the starting value of 
*B*, 3ee figure A. 3 . In this example variables *A* and *B* 
can be treated as twc separate and distinct variables even 
though they came from the same data set. In this example, 
•B* has a time lac of one period from *A'. This same 
approach can be expanded to create additional data sets 
utilixing as many time lags is as deemed necessary for iden- 
tifying the appropriate model. 
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Figure i.2 fioz-JeuJcins Method. 
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Figure A. 3 lagged fariables. 

We can consider variatles *A* and *B* as two vari- 
ables and calculate tleir ccrrelation coefficients. A coef- 
ficiert cf .80 between *A* and for example, would imply 
that successive values of 'A* with one period (lag) between 
them are positively correlated with each other and tend to 
move in the same direction. Sisilarly, a coefficient of -.70 
between *A* and *B* would indicate that successive values of 
*A* lagged cnce are regatively correlated and tend to meve 
in opposite directiers. However, since variable * E* is 
actually derived froi variable 'A', such an association is 
called attc-(self-) ccrrelation. 

Aut ccorrelaticns provide significant clues about the 
underlying pattern cf a data set. In a set of ccapletely 
random data the autocorrelations among successive values 
will be close to zeio, but data values of strong seasonal 
and/or cyclical character will be highly correlated. 

Figure A. 4 £Se£. 13] for instance, presents the autccorrela- 
tions cf different time lags of monthly temperature in 
Earis. Ihese autocorrelations reveal a strong seasonal 
pattern. 
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It is this tjfs of information gained from autcccr- 
relaticns that can te utilized hy the Bor-Jenkins affioach 
to identify the optinal forecasting model. Of paiticular 
interest is that the analyst needs to know nothing of the 
data set or its patterns to obtain its autocorrelation coef- 
ficients. These coefficients can he used to describe the 
data set and assist in identifying a tentative model to be 
fitted tc tie data. 

Khat Figure A. 4 implies is that temperature of 
months twelve periods apart are highly correlated. Although 
this is basically intuitive, if this were not previously 
known the autocorrelations would be of significant benefit 
in suggesting an appropriate model to describe the data 
pattern £Bef. 13]. 

lox -Jenkin s Model Types 

The Box-Jenkins methodology postulates three general . 
classes of models that can generally describe any type or 
pattern of time series data: 

a. Autoregressive (Afi) 

b. Moving Average <MA) 

c. Mixed Autoregressive-Moving Average (ABMA) 

An autoregressive model (AB) has the form: 



Ir =9,5,., 



(egn A.1} 



where I 7 is the independent variable, high guality recruits, 
and X-r.| # ^r-j are "independent” variables. 

In this case the independent variables are values of the 
same variable, but of previous periods (t- l,t-2, t-3, . . . t-p) . 
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Pinally € is the error or residual term that represects 
raadoB events that are not explained by the model. ihe 
residuals (errors) of the model should be white noise, that 
is they should have a mean of zero, a constant variacce and 
he unccrrelated ever time. 

The model described is autoregressive because it is 
similar tc the regression eguation 



Y=a4h, J, ♦hjXj + t,Xj ♦.. .>b^ +e^ 



(egn A. 2} 



except ■|±iat: 

X, =Xr-. ^ = = 



and thus the independent variables are only lagged values of 
the deperdart variable with time lags of 1, 2, 3,..., p 
periods. Ihe autoregression model is similar in functicc and 
form tc the regression equation. However it is net appro- 
priate tc describe all possible underlying data patterns in 
time series analysis. Thus, the Box-Jenkins approach also 
considers two other classes cf models. 

Ihe Moving Average (HA) model has the form: 



Yr = «T-0.«r-. (eqn A. 3) 

where e^ is the error or residual and e , e^,^ 

,...,€r,^ are previous values of the error. This implies 
that the dependant variable Yi- , depends on previous values 
cf the error term (e,._, rather than on 

the variable itself. The future value of high quality 
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enlist meets could be predicted by utilizing the errer of 
each cf several past predictions. In the same manner that 
autocorrelation among successive values of Y can he exam- 
ined, the autocorrelation among the successive values of the 
residuals can also be examined. 

Ihe third class of model considered by the 
Eox-Jenkins methcdolcgy is a mixed model. The best descrip- 
tion of the pattern of data may be provided by a mixed 
process cf and MA elements. The general mixed mcdel has 
the fexm: 









(egn A. 4) 



It is evident that this model is simply the AR and HA models 
combined. It suggests that future values of high quality 
lecroits depend cn both past values of high quality enlist- 
ments and the errors tetueen forecasts and actual values. 

5. Ide ntif ying a T enta five M odel 

It is possible to suggest some specific ARHA (p,q) 
model by examining the autocorrelation coefficients and a 
similar set of parameters, the partial autocorrelation coef- 
ficients. Partial autccorre laticn coefficients are analcgcus 
to autcccrrela ticn coefficients in that they indicate the 
relationship of the values in a time series to various 
lagged values of the same series. However, they differ frem 
autoccrxelations in that they are computed for each time lag 
after removing the effect of all other time lags in the 
given time lag and on the original series. In essence, they 
show the relationship that exists for varying time lags 
[fief. 8]. 
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Ccmtinations cf aut ccorrelation and partial autcccr- 
relaticn coefficients, in the absence of randomness, can 
reveal the exact ABMA model. Randomness complicates the task 
because it influences the values of the autocorrelation 
coefficients and may cause them to deviate from their "true" 
pattern. Ihis makes acdel identification more difficult, tut 
generally there is eicugh information available to select an 
appropriate model. £ven if there is a mistake at this point 
in the analysis, it kill be found when a test of the acdel 
is conducted. Figures A. 5 through A. 9 demonstrate the 
tehavicr cf the theoretical autocorrelations (ACs) and 
partial autocorrelations (PACs) for each model, where "HA" 
denotes a moving average model, "SHA" denotes a seasonal 
moving average model, "AR" denotes an autoregressive model 
and *'SAR" denotes a seasonal autoregressive mcdel. 
£Ref. 10] 



Thus in order to forecast a time series, the auto- . 
correlations and partial autocorrelations must be computed 
first. Scaetimes tie pattern of the computed autocorrela- 
tions and partial autocorrelations can be easily classified 
as one of the theoretical ones described below. The identi- 
fication cf an appropriate model is then relatively easy . 
ether times, some free association and intuitive judgement 
is reguired to be able to infer a pattern from the autcccr- 
relaticns, or mere than one pattern may be suggested. In 
such instances, a choice can be made based on goodness of 
fit and nuaber cf parameters involved. Once a tentative 
model has heen made, the next step is to estimate the param- 
eters. 



^ • Est i mat ing the Para a et er s 

Assuming that the tentative model is an ARHA (1,1) 
its' mathematical form is 
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^ = ®*r.. 



(eqn A. 5) 




r-i 



in order tc utilize ttis model, the value of (D and 0 must 
te estimated. Ihe general apfroach is to start with seme 
estimated initial values for Q) and 0 and then modify them by 
small steps while observing the sum of the squared errors 
<5S£) . Ihis allows the direction of change in (Z> and 0 that 
result in the siallest SSE to be determined. Eventually, 
the O and 0 corresponding to the minimum SSE are found and 
used as the final estimates for the model. The next step is 
to test the adequacy cf the model. 

7 . Tes ting the hcdel^s Adeq uacy 

If the model is adequate, the residuals (differ- 
ences) between the time series values and those estimated by 
the model must be random (white noise). The autocorrelation 
coefficients for these residuals are examined. If none of 
the autocorrelation coefficients of the residuals are 
significantly different from zero (plus or minus two stan- 
dard errors), the errors are assumed to be white noise and 
the model is adequate. If the autocorrelations are not 
random, the analyst must return to step two and select 
another model. 
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Figure A. 5 AR (1) flodel ACs and FACs. 
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PartiaJ Autocorrelations 



AR ( 2 ) 
( t ), = -^0 6 
(b,= + 0 2 









AR(2) 
tl), = -0.8 
(J) . = - 0.6 





Figure A. 6 AB (2) Bodel ACs and PACs. 
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Figure A. 7 HA (1) flodel ACs and PACs. 
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Autocorrelations 



Partial Autocorrelations 










Figure A. 8 HA (2) Bodel ACs and PACs. 
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Hgorc A.9 Biicd Araa (1/1) Model ACs and PACs 



p. 



Aulocorrtrlaiions 



Span= 12 




Partial Autocorrelations 










Figure 1.10 Afi(1) 1 SAfi(1) Seasonal Bodel ICs and FlCs. 
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Span = 1 2 
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Partial Autocorrelations 










Figure 1-11 AB(2) 1 SAB(1) Seasonal Hodel ACs and PACs. 
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Autocorrelations 



ParuaJ Autocorrelations 
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Figuie i. 12 MA(1) 1 SHA(1) Seasonal Sodel ACs and SICs. 
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Figate 1.13 HA(2) I SHA(l) Seasonal Model ACs and fACs 
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TIME SERIES PLOTS OF THE VARIABLES 
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Figure B.5 PAXIAIO 
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